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IDENTIFICATION OF A NOVEL HDMRN 
RECEPTOB TYROSINE KINASE GENE 



FIELD OF T HE INVENTION 



This invention relates to the DNA sequence 
encoding a novel human growth factor receptor which is 

15 a type III receptor tyrosine kinase. The receptor is 

referred to as Kinase insert Domain containing Receptor 
(JSm) and binds specifically to the growth factor 
vascular endothelial cell growth factor (VEGF) . This 
invention also relates to the amino acid sequence of 

20 the receptor. 

BACKGROUND OF THE INVENTION 

Growth factors are small molecules which 
25 regulate normal cell growth and development through 

interaction with cell surface receptors. The receptors 
for a number of growth factors are referred to as 
tyrosine kinases; that is, binding of growth factor to 
the receptor stimulates an increased phosphorylation of 
30 tyrosine amino acids within the receptor; this is turn 

leads to cellular activation (Bibliography 1) . 

There is increasing evidence that genetic 
alterations affecting the expr ssion of receptor 
tyrosine kinases (RTK) can contribute to the altered 
35 cell growth associat d with cancer. This conclusion is 
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supported by the frequent identification of RTK as 
products of the oncogenes for many of the acutely 
transforming retroviruses (e.g., 2,3,4) and the 
overexpression of RTK in certain cancers (5) . The 
5 identification of a novel RTK may lead to a better 

understanding of cell growth under both normal and 
transforming circumstances. 

The amino acid sequence in the catalytic 
domain of all tyrosine kinases has been conserved (6) . 

10 Detailed analysis of the amino acid sequences within 

the catalytic and noncatalytic domains of RTK indicates 
the existence of distinct structural subtypes. One 
group of RTK (designated type III) includes the sJcit 
proto-oncogene and the receptors for platelet derived 

15 growth factor (PD6F) and colony stimulating factor-1 

(CSF-1) . 

The most unusual feature of this subtype is 
that its catalytic (kinase) domain is interrupted by a 
long insertion sequence of 12-102 amino acids (the 

20 kinase insert domain) The two peptides constituting 

the kinase domain are conserved between the receptors, 
while the sequence of the kinase insert domain is 
unique for each receptor. 

Several approaches have been tried in order 

25 to identify novel RTK, including low-stringency 

screening of cDNA libraries with previously 
characterized DNA probes (7) . More recently, a 
technique has been developed that is capable of greatly 
facilitating the identification of novel genes for 

30 which some sequence data are known. The polymerase 

chain reaction (PCR) has been used to identify novel 
members of several gene families including those of 
guanine nucleotide regulatory prot ins (8) and protein 
phosphatases (9) . PCR has been us d to identify nov 1 

35 tyrosine kinase gen s (10) , though th primers us d in 
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that study were designed from DNA segments contained in 
all tyrosine kinases, rather than being specifically 
directed against RTK. It is a continuing goal to 
identify receptors for growth factors. 
5 The elucidation of the growth factors, as 

well as their receptors, involved in regulating 
endothelial cell function is critical for the 
understanding of how new blood vessels are formed 
(angiogenesis) . Angiogenesis plays a significant role 

10 in both normal and pathological events such as 

embryogenesis, progression of ocular diseases, and 
wovmd healing (11) . In particular, angiogenesis is an 
important process for the growth of tumors (11) . 
Angiogenesis is a complex process involving endothelial 

15 cell proliferation, migration, and tissue infiltration. 

These events are stimulated by growth factors which 
either (i) act directly on endothelial cells (12,13), 
or (ii) act indirectly by inducing host cells to 
release specific endothelial cell growth factors (11) . 

20 One member of the first group is vascular endothelial 

cell growth factor (VEGF) , also known as vascular 
permeability factor (14-16). Besides its angiogenic 
activity, VEGF displays the physiological function of 
increasing the permeability of capillary vessels to 

25 different macromolecules (14) . 

^TmhPy °^ TMVENTION 

The present invention relates to novel DNA 
30 segments which together comrpise a gene which encodes 

type III RTK. The type III RTK encoded by the gene is 
designated the ^ protein (which stands for Kinase 
ins rt Domain containing Rec ptor) . The Effi prot in 
binds specifically to th growth factor VEGF (vascular 
35 endoth lial cell growth factor) . 
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The DNA segments are identified and isolated 
through the use of PGR technology. The overall 
strategy is summarized as follows: 

PCR is used to amplify the DNA segments 
5 corresponding to the kinase insert domains of type III 

receptor tyrosine kinase genes in an endothelial cell 
library designated HL10246 (Clontech Laboratories, 
Inc. , Palo Alto, CA) . Degenerate oligonucleotide 
primers are designed which are complementary to 

10 conserved tyrosine kinase domains flanking the kinase 

insert domains of known type III receptor tyrosine 
kinases. These primers are used in the PCR procedure. 
DNA probes, designed from the DNA sequence of the PCR 
product, are then used to identify cDNA clones of the 

15 receptor gene from the original cDNA library. 

In particular, the present invention relates 
to specific oligonucleotides which, when used as 
primers for PCR, allow for the amplification of DNA 
segments corresponding to the kinase insert domains of 

20 type III RTK genes. 

In a principal embodiment, the present 
invention is directed to three overlapping DNA segments 
(designated BTIII081.8, BTIII129.5 and BTIV169) which 
comprise the entire coding region of this novel gene, 

25 namely, 4,068 nucleotides extending to the 3« end. 

These DNA segments are isolated from a human 
endothelial cell cDNA library and together comprise 
the gene coding for a novel type III receptor tyrosine 
kinase. The human gene containing these DNA segments 

30 is referred to hereinafter as KDB (which stands for 

Kinase insert Domain containing Receptor) or, 
alternatively, as kdp (which stands for Kinase insert 
Domain containing Protein) . The us of the term SEE is 
intend d to include any DNA s gments which form the 



35 
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human gene which encodes the novel type III RTK of this 
application. 

The DNA segments embodied in this invention 
are isolated from human sources. The present invention 
5 comprises DNA segments, and methods for using these DNA 

segments, which allow for the identification of a 
closely related gene in mouse DNA. The methods 
developed in this invention can be readily used by 
those skilled in the art for the identification and 

10 isolation of closely-related homologues in other 

species. Therefore, the present invention also 
embodies all DNA segments from species other than human 
which encode proteins having substantially the same 
amino acid sequence as that encoded by the kdp gene. 

15 The present invention further relates to 

methods developed for the detection of mRNA's produced 
as a result of transcription of the sense strands of 
the DNA segments of this invention. Messenger RNA 
prepared from bovine endothelial cells are used in 

20 developing these methods. The ability to detect mRNA 

for a novel RTK may ultimately have medical benefit, 
especially in light of recent observations that the 
mRNA for certain RTKs are overexpressed in some cancers 
(5). 

25 The methods developed in the present 

invention for detecting mRNA expressed by the kdp gene 
can be readily used by those of ordinary skill in the 
art for the detection of mRNA species related to the 
kdp gene in any cell type and from any species. For 

30 this reason, the present invention embodies all mRNA 

segments which are the result of tremscription of the 
kdp gene. 

The pr sent invention relates to m thods for 
expr ssion of the rec ptor prot in, for example, in 
35 CMT-3 cells of monk y kidney origin. Th receptor 
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protein, portions thereof, and mutated forms of the 
receptor protein may be expressed in many other cells 
by those skilled in the art using methods similar to 
those described in this application. For this reason, 
5 the present invention embodies all proteins encoded by 

the human gene and proteins encoded by related 
genes found in other species. 

The present invention further relates to 
methods for studying the interaction of VEGF to the 

10 expressed KDR protein. Recent work in the literature 

(17) indicates that VEGF is one member of a family of 
related proteins, and the interaction of growth factors 
similar to VEGF with the EDE protein can readily be 
studied by those skilled in the art using methods 

15 similar to those described in this application. These 

methods can readily be modified to study the 
interaction of candidate pharmaceuticals with the 
protein towards the goal of developing an antagonist or 
agonist of VEGF action. For this reason, the present 

20 invention embodies methods for studying the interaction 

of VEGF and VEGF-related growth factors with the ^ 
protein. 



25 



pWTEF DE.?f^TPTION OF THE DRAWINGS 



Figure 1 depicts a schematic representation 
of three receptor tyrosine kinase subclasses (6) . KI 
is kinase insert domain; PTK is kinase domain; cys is 
cysteine rich region. 

30 Figure 2 depicts the two sets of primers used 

for PCR (SEQ ID No: 1 and 2) . The nucleotide sequences 
in appropriate regions of the foxir known type III 
r c ptor tyrosin Icinas cDNAs are aligned and 
d gen rate oligonucl otide primers ar designed based 

35 upon th consensus s quenoes. 



wo 92/14748 



PCT/US92/01300 



- 7 - 



10 



Figure 3 depicts the amplification of the 
kinase insert domains using PCR. DNA segments encoding 
the kinase insert domains of type III receptor tyrosine 
kinases are amplified by PCR. A sample (5 ^1) is run 
on a 1.0% agarose gel which is stained with ethidium 
bromide. DNA size standards (123 bp ladder; Bethesda 
Research Laboratories, Bethesda, MD) are run as well. _ 

Figure 4 depicts the DNA sequence of the two 
PC3i products (Panel A: 363 bp segment derived from the 
420 bp product (SEQ ID NO: 3) ; Panel B: 251 bp product 
(SEQ ID NO; 4)). The two products are purified by 
agarose gel electrophoresis, digested with gall and 
SSfiRIf and cloned into the plasmid vector 
pBlueScribe(+) (Strategene; San Diego, CA) . The 420 bp 
15 PCR product is digested to 363 bp dviring this 

procedure. The DNA sequences for the primers used in 
the amplification are underlined. 

Figvire 5A depicts a computer assisted 
comparison of the DNA sequence for the 363 bp DNA 
20 segment derived from the 420 bp PCR product with the 

sequence of a DNA segment of the PDGF receptor (SEQ ID 
NO; 5) (18) . A region of strong homology between the 
363 bp segment derived from the 420 bp PCR product and 
the PDGF receptor is contained in a box. Figure 53 
25 depicts a computer assisted comparison of the DNA 

sequence for the 251 bp PCR product with the sequence 
of a DNA segment of the FGF receptor (SEQ ID NO: 6) 
(7). 

Figure 6 depicts the strategy used for 
30 sequencing the insert portions of clones BTIII081.8 and 

BTIII129.5 and BTIV169. The sequencing reaction uses 
either synthetic oligonucleotides (represented by boxes 
at the start of an arrow) , or the M13 universal primer 
(no box) to initiate the reaction. In som cases, 
35 portions of th s DNA segments ar isolated using the 
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restriction enzymes indicated in the figure, and 
subcloned back into the plasmid vector pUCllS, so that 
the M13 universal primer can be used. The position of 
the stop codon in BTIII129.5 is indicated. The coding 
5 portions of these DNA segments are shown at the bottom 

of the figure. The relative positions of the 1) 
membrane spanning portion, 2) kinase domains, and 3) 
kinase insert domain are indicated. The position of 
these structural features within the KDR derived DNA 

10 segments is compared in relation to their position in 

the PDGF-receptor ("PDGF-R") . 

Figure 7 depicts the DNA and predicted amino 
acid sequence of SSE, plus the stop codon (nucleotides 
1-4071 of SEQ ID NO. 7) . The sequence of the DNA 

15 segment amplified by PCR is underlined (nucleotides 

2749-3105 of SEQ ID NO. 7). Cysteine residues in the 
putative extracellular domain are circled. Potential 
N-linked glycosylation sites are indicated by em 
asterisk. The putative membrane spanning region is 

20 enclosed in a box (nucleotides 2293-2367 of SEQ ID NO. 

7). 

Figure 8 depicts a hydropathy plot of the 
predicted amino acid sequence for the KDR protein. 
Figure 9 depicts a compeurison of the 

25 predicted zmino acid sequence in the putative 

intracellulaur portion of the protein to the sJsit 
proto-onoogene (SEQ ID No: 8) (3) , the CSF-1 receptor 
(SEQ ID NO: 9) (4), and the PDGF receptor (SEQ ID NO: 
10) (18) . Exact matches are indicated by an asterisk. 

30 Gaps are introduced to achieve maximum alignment. The 

putative ATP recognition site is indicated by three 
asterisks . 

Figure 10 depicts th identification of kdp 
rec ptor mRNA by North m blot analysis. Five 
35 micrograms of bovine aortic endothelial cell polyA+ RNA 
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are used. A nick-translated [^^P] CTP-labelled 
EcoR l/ BamH I DNA segment (nucleotides 1510-2417 of SEQ 
ID NO. 7) is used as a probe. Autoradiography is for 
36 hours. 

5 Figure 11 depicts the kdp gene in humsm and 

mouse DNA by Southern blot analysis. A nick translated 
[^^P] CTP-labelled EcoRI/BamHI DNA segment (nucleotides 
1510-2417 of SEQ ID NO. 7) is used as the probe. The 
probe is hybridized to Southern blots containing EcoRI 

10 digested DNA from human (lane 1), mouse (lane 2), and 

human-mouse hybrid cells (19) (lanes 3 and 4). The DNA 
used in lane 3 lacks the kdp locus, while DNA used in 
lane 4 contains the kdp locus. 

Figure 12 depicts a Western blot analysis of 

15 C3IT-3 cells which express the 2DB protein. Cells are 

transfected with either the pcDNAltlqiASP vector alone 
(lane 1) or with that vector modified to contain the 
KDR gene (lane 2). 2 x 10^ cells and 1 microgram of 
DNA are used for each transfection. Forty-eight hours 

20 later. Western blot analysis is performed on the 

samples using the anti-H2B.PS23 polyclonal antibody at 
a dilution of 1:1000. Detection of reacting proteins 
is performed using an ECL system (Amershsun, Chicago, 
IL) . 

125 

25 Figure 13 depicts the results of I] VEGF 

binding to CMT-3 cells which express the KDR protein. 
Cells are transfected with either the vector alone 
(bars 1 and 2) or with the vector containing the yPR 
gene (bars 3 and 4) . Forty-eight hours later, the 

30 samples are washed with phosphate buffered saline 

(PBS) , and incubated with serum-free media containing 
50 pM [^^^I] VEGF (specific activity equal to 4,000 cpm 
per fmol) , for 90 minutes. Nonradioactive VEGF, 5 nM, 
is added to some samples (bars 2 and 4) to defin 



35 
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specific binding sites. The samples are washed with 
ice cold PBS, and the cells are transferred to 
co\mting tubes using 0.1% lubrol. 

Figure 14 depicts the results of affinity 
5 cross-linking of [^^^I] VEGF to caiT-3 cells which 

eacpress the E2S protein. CMT-3 cells are transfected 
with either the vector alone (lane 1) or with the 
vector containing the H2B gene (lane 2) . Forty-eight 
hours later, the cells are washed in PBS, and serum 
10 free media containing 200 pM [^^^I] VEGF is added. 

After 90 minutes at room termperature, an affinity 
cross-linker disuccinimidyl soiberate, 0.5 mM, is added 
for 15 minutes. The samples are then prepared for 
SDS-PAGE autoradiography. 

15 

mZTXTUET) DES gRTPTIOW OF THE INVENTION 

The strategy used to discover the DNA 
segments for the novel type III RTK gene begins with 

20 the design of two degenerate oligonucleotide primers 

based upon their homology to specific regions of the 
kinase domains of known RTK genes (Fig. 2) (3,4,7,18). 
In one embodiment, the polymerase chain reaction is 
then used to amplify DNA segments from a human 

25 endothelial cell cDNA library (designated HL 10246) . 

The cDNA products from this step are each cloned into a 
plasmid vector designated pBlueScribe+ (Strategene, San 
Diego, CA) and sequenced. Oligonucleotide probes are 
designed from potentially interesting sequences in 

30 order to screen the cDNA library for more full length 

clones of the novel cDNA. 

The strategy just described provides several 
novel elements: 1) the DNA sequences of the oligonuc- 
1 otid primers us d during PCR; 2) the DNA s quence of 

35 the products generated by th polymerase chain 
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reaction; and 3} the DNA sequence of the final cloned 
DNA segments. Each of these elements of the Invention 
described In this application will now be discussed In 
detail . 

5 Flcpire 2 shows the rationale for choosing the 

oligonucleotide primers used In the PCR. The primers 
are designed to allow for the PCR amplification of the 
kinase insert domain of type III RTK genes. In order to 
design the primers, the DNA sequences of known type III 

10 RTK genes are aligned in specific regions of their 

catalytic domains, and a consensus sequence is chosen. 
The regions of the catalytic domains chosen in 
designing the primers flemk the kinase insert domains 
of the receptor genes. 

15 Primer 1 (SEQ ID No: 1} is designed from a 

region of the kinase domain 5' to the kinase insert 
domain, and consists of a mixture of four different 
21mers. Primer 2 (SEQ ID NO: 2} is designed from a 
region of the kinase domain 3' to the kinase Insert 

20 domain, and consists of a mixture of sixteen different 

29mers with one Inosine, indicated in SEQ ID NO: 2 by 
"N". 

Sai l and EcoR I restriction sites are Included 
at the 5* end of primers 1 and 2, respectively, to 

25 facilitate the subcloning of the amplified PCR products 

into plasmid vectors. Those skilled in the art may use 
other restriction sites; other minor modifications in 
the protocol above permits the design of primers 
without the inclusion of restriction sites. 

30 The selection of these specific primers 

constitutes a novel approach towards identifying novel 
type III RTK genes. It had previously been shown (10) 
that primers designed from DNA sequences common to all 
tyr sine kinas s allows f r the id ntif ication f novel 

35 pr t ins. Th pres nt invention is th first to 
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contemplate the use of PCR to specifically target type 
III RTK. 

The protocol used for PGR is as follows: 
Human endothelial cell cDNA (designated HL10246) is 
5 denatured by boiling and submitted to 30 cycles of PCR 

using 1 nmol of both primers in a final volume of 100 
,^1. The timing is 1.5 minutes at 92°C, 2 minutes at 
50°C, and 2 minutes at 74°C. DHA from 5/il of sample is 
separated on a 1% agarose gel and stained with ethidium 
10 bromide. 

Figure 3 shows the resiats of the PCR 
amplification. Two DMA products, with sizes 251 bp 
(SEQ ID NO: 4) and 420 bp, are visible when a sample of 
the reaction is electrophoresed on a 1.0% agarose gel 
15 and stained with ethidium bromide. The sizes of the 

two products are within the range expected for type III 
RTK genes (products derived from the FGF and PDGF 
receptor genes, which have the smallest and largest 
known kinase insert domains, would be 230 and 510 bp, 

20 respectively (20, 21). 

The DNA from four continguous lanes with 
sizes ranging from 200 to 600 bp is electrophoresed 
onto DEAE filter paper, eluted from the paper with 
salt, and ethanol precipitated. The samples are 

25 incubated with 5 units of SseRI and Sail. The 

restriction enzymes digest the 420 bp DNA segment to a 
363 bp DNA segment (SEQ ID NO: 3) , due to the presence 
of an EcoRI site within the 420 bp DNA segment 
(nucleotide 2749, SEQ ID NO. 7) . The restriction 

30 enzyme digested PCR products are then subcloned into 

the plasmid vector pBlueScribe(+) . The recombinant 
clones are analyzed by sequencing using the 
dide xy-method (22) using a United States Biochemical 
(Cleveland, Ohio) S qu nase Version 2.0 sequencing kit. 

35 Figure 4 shows th DNA sequences for th 251 bp PCR 
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product and the 363 bp DNA segment derived from the 420 
bp PGR product. 

Computer assisted comparison of the DNA 
sequence for the 363 bp segment of the 420 bp PGR 
5 product to databases of known DNA sequences reveals 

that the sequence is novel, because it shares strong 
sequanca identity with the flanking catalytic domain of 
known type III RTK genes, but not their kinase insert 
domains. Figure 5A compares the DNA sequence for the 

10 363 DNA segment with that for the PDGF receptor gene 

(SEQ ID No: 5) . Similar results are obtained using 
other type III RTK genes. 

DNA sequencing of the 251 bp PCR product 
reveals a novel sequence containing both primers used 

15 for the amplification, but the sequence shows little 

homology to known tyrosine kinases. This is depicted 
in Figure 5B, which compares the DNA sequence for the 
251 bp DNA segment with that for the FGF receptor (SEQ 
ID NO: 6) . For this reason, further analysis of 

20 Product 1 is not pursued. 

The protocols used during the PCR do not 
allow for amplification of the kinase insert domains of 
known receptor tyrosine kinases in the endothelial cell 
library used because of the low copy number of the 

25 message present in the library. There have been many 

studies on the effect of FGF on endothelial cell 
fvmction (23,24) although there is evidence that the 
expression of the FGF receptor is development ally 
regulated (7) and it is likely that the library used 

30 contains little or no cDNA for the FGF receptor. 

An oligonucleotide probe, designed from the 
DNA sequence of the 363 bp segment, is synthesized 
(using an ABI 380 DNA Synthesizer) in order to screen 
the human endothelial cell cDNA library (HL10246) for 

35 the isolation of more full 1 ngth clones containing the 
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10 



15 



363 bp DNA segment. The probe sequence is chosen from 
the region of the 363 bp DNA segment which shares 
little sequence homology with known RTK. 

The screening of the endothelial cell cDNA 
library is conducted as follows: Lambda gtll phage, 
10^, are adsorbed to £• saiA 1^392 for 15 minutes at 
37°c prior to plating onto agar plates at a density of 
5 X 10^ phage per plate. After allowing the phage 
plaques to develop at 37°C, plaque lifts are made using 
nitrocellulose filters, denatured in 0.4 N NaCl for 1 
minute, and neutralized in 0.5 M Tris.HCl, pH 7.3, plus 
1.5 M NaCl. The filters are washed with 2 x standard 
saline citrate (SSC) and then baked for 1.5 hour in a 
vacuum oven at 80°C. The filters are probed with an 
[^^P] ATP end labeled synthetic oligonucleotide, 5» - 
TTTCCCTTGACGGAATCGTGCCCCTTTGGT-3 ' , which is the reverse 
complement of a DNA sequence contained in the PGR 
amplified product (Fig. 3) . Hybridization is performed 
at 50°C in 5 X SSPE (167 mM NaCl, 10 mM sodium 
phosphate, pH 7.4, 1 mM EDTA) , 2.5 X Denhardts, 0.5% 
sodium dodecyl sulfate (SDS) , 100 /ig/ml salmon sperm 
DNA. The filters are washed twice, 20 minutes per 
wash, with 2 x SSC plus 0.1% SDS at room temperature, 
followed by washing twice at 50°C with 0.1 X SSC plus 
25 0.1% SDS; 20 minutes per wash. Positive clones are 

identified, picked and plaque purified. 

Forty-five positive clones are obtained. 
Three of these positive clones are plaque purified and 
their phage DNA isolated. Digestion of the DNA with 
30 EcoRI and electrophoresis in agarose indicates that one 

clone, designated BTIII081.8, contains the largest 
insert, and subsequent analysis indicates that the DNA 
insert of this clone overlaps that of the inserts 
contained in other two purifi d clones (designat d 
35 BTIII079 . 11 and BTIII079 . 47A) . 



20 



wo 92/14748 



PCT/US92/013(M) 



Digestion of the piirified phage DNA of the 
clone designated BTIII081.8 with EssRI results in DNA 
segments of 250 bp, 600 bp, and 1000 bp. Each of these 
three products is subcloned into the plasmid vector 
5 pUClls and sequenced (Figure 6 shows the strategy used 

for sequencing) . The orientation of the three 
fragments is determined by subcloning from the insert a 
qglll/Balll fragment into pUCllS and sequencing across 
the EcoRI junctions using a synthetic oligonucleotide 

10 to prime the sequencing reaction. 

A restriction map is determined for each 
fragment (Figure 6). Various restriction site pieces 
are removed from the plasmids and recloned into pUCllS 
so that sequencing the resulting plasmids with the 

15 universal primer allows for sequencing most of the 

entire original fragments in both directions. Three 
oligonucleotide primers are required to sequence the 
entire cDNA in both directions. For the purposes of 
this application, this insert contains nucleotides 

20 numbered 1510-3406 (SEQ ID NO. 7) . 

A ["'^PJCTP-labelled, nick-translated 
EcoR I- BamH I DNA segment derived from clone BTIII081.8 
(nucleotides 1510-2417 of SEQ ID NO. 7) is used as a 
probe to rescreen the original endothelial cell cDNA 

25 library for more 5' full length DNA segments of the 

gene from which the insert portion of BTIII081.8 is 
derived. The protocols used to isolate the overlapping 
clones are identical to that used to isolate 
BTIII081.8. 

30 A synthetic oligonucleotide probe is designed 

with 29 nucleotides corresponding to part of the DNA 
sequence of the insert portion of the clone BTIII081.8 
(nucleotides 3297-3325 of SEQ ID NO. 7) in order to 
r screen the original ndoth lial cell cDNA library for 

35 more full 3' length DNA segm nts of the gene from which 
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the insert portion of BTIII081.8 is derived. The 
protocols used to isolate the overlapping clones are 
identical to that iised to isolate BTIII081.8. Several 
positive clones for each of the 5' and 3' ends are 
5 identified and plaque purified. 

One of the clones is designated BTIII200.2. 
The DNA from BTIII200.2 contains a 3.4 kb insert as 
determined by EcoRI digestion of the isolated phage 
DNA. EcoRI digestion of BTIII200.2 results in three 

10 DNA fragments. One of thse fragments (2.5 kb) is 

cloned into pUC119 emd is designated BTIV006. The 
clone BTIV006 contains nucleotides numbered 7-2482. As 
described below, BTIV006 plus nucleotides 1-6 is 
designated BTIV169. DNA sequencing of the 2.5 kb DNA 

15 insert (BTIV169) indicates that it overlaps over one 

thousand nucleotides of the DNA sequence of the insert 
portion of the clone BTIII081.8 (Figtire 6) at the 5« 
end. 

A second clone isolated from the cDNA library 

20 is designated BTIII129.5. The DNA from BTIII129.5 

contains a 2.2 kb insert as determined by EssRI 
digestion of the isolated phage DNA. DNA sequencing of 
the 2.2 kb DNA insert indicates that it overlaps over 
five hundred nucleotides of the DNA sequence of the 

25 insert portion of the clone BTIII081.8 (Figure 6). The 

clone BTIII129.5 contains nucleotides numbered 
2848-4236 (SEQ ID NO. 7) . The DNA sequence for 
BTIII129.5 contains the stop codon TAA, defining the 
position of the 3« end of an open reading frame for the 

30 novel gene. Except for the first six nucleotides of 

the gene which are discussed below, these three clones 
define a gene encoding a growth factor receptor. These 
thr e clones define a 4,062 nucl otide s quenc of the 
open reading frame of the gene ext nding to the 3' end, 

35 follow d by a 168 nucleotid non-coding region (SEQ ID 
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NO. 7) . A sample of a lambda gtH phage harboring the 
clone BTIII081.8 has been deposited with the American 
Type Culture Collection, 12301 Parklawn Drive, 
Rockville, Maryland 20852, U.S.A., and has been 
5 assigned ATCC accession number 40,931. A seunple of a 

lambda gtll phage harboring the clone BTIII129.5 has 
been deposited with the American Type Culture 
Collection and has been assigned ATCC accession nvunber 
40,975. For reasons discussed below, a seunple of the 

10 clone BTIV006 was not deposited. 

The aforementioned ONA segments (BTIII081.8, 
BTIII129.5, and BTIII200.2 (or BTIV006) encode 4062 
nucleotides of the coding portion of a novel gene. The 
cDNA clones are incomplete in that a transcription 

15 initiation coding for methionine is missing. After the 

isolation of these clones, Matthews et al. (25) 
reported the cloning of a gene homologue of KDR in 
mouse, which was referred to as fUs-l. Analysis of the 
nucleic acid and amino acid sequence of £ll5-l indicated 

20 that the addition of six nucleotides to the 5' end of 

the isolated KDR clones would provide for a complete 
coding region. 

To achieve this, an EcoR I-BaroHI restriction 
fragment of BTIV200.2 is cloned into the plasmid 

25 pBlueScript KS (Strategene, La Jolla, CA) . The 5' end 

of the inserted DNA is blxint ended with Klenow 
polymerase and Mung Bean nuclease. Next, the synthetic 
oligonucleotide TCGACGCGCG ATG GAG (SEQ ID NO. 11) is 
cloned into this vector. The oligonucleotide contains 

30 the sequence ATG GAG in freune with the downstream DNA 

insert. These nucleotides (ATG GAG) encode the amino 
acids methionine and glutamic acid, the first two amino 
^ acids encoded by the gSfi g n . The resulting plasmid ' 
V ctor is designated BTIV140. This plasmid is purifi d 

35 on a CsCl gradient. 
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The pvirified plasmid is designated BTIV169. 
The Insert of BTIV169 contains nucleotides 1-2400 (SEQ 
ID NO. 7) of the Hffi gene. A sample of the plasmid 
pBlueScript KS which contains the clone BTIV169 has 
5 been deposited with the American Type Culture 

Collection and has been assigned ATCC accession number 
75200. 

Thus, together the clones BTIII081.8, 
BTIII129.5 and BTIV169 comprise the entire open reading 

10 frame of 4,068 nucleotides for the novel Sffi gene. As 

will be discussed below, the Hffi gene expresses the 
novel receptor which binds specifically to the 
growth factor VEGF. 

DHA sequencing of BTIII081.8, BTIII129.5 and 

15 BTIV169 (SEQ ID NO. 7) shows that the newly isolated 

gene is similar to, but distinct from, previously 
identified type III RTK. The predicted amino acid 
sequence (SEQ ID NO. 7) contains several structural 
features which demonstrate that the novel gene is a 

20 type III RTK. These structural features are summarized 

as follows: 

1) A hydropathy plot of the predicted junino 
acid sequence indicates a single membrane spanning 
region (see Figure 8) . This is characteristic of a 

25 type III RTK (Figure 7). 

2) The putative amino-terminal 762 amino 
acid portion of the receptor has structural features of 
extracellular receptor ligemd binding domains (1) , 
including regularly spaced cysteines and 18 potential 

30 N-linked glycosylation sites (Figure 7). 

3) The predicted amino acid sequence of the 
carboxy-terminal 530 amino acid portion contains an 
ATP-binding sit at lysine 868, 22 amino acids 
downstream from th consensus ATP r cognition sequence 

35 Gly-X-Gly-X-X-Gly (26) (Figur 8). 
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4) Within the kinase domain there is a 
55-60% identical match in amino acid sequence to three 
other type III receptor tyrosine kinases: ckit 
proto-oncogene (SEQ ID NO: 8) , CSF-1 (SEQ ID NO: 9) and 

5 PDGF (SEQ ID NO: 10) (Figure 9) . 

5) The predicted kinase domain contains a 
kinase insert domain of approximately 71 amino acids. 
As indicated in Figure 9, this portion of the amino 
acid sequence shares little sequence homology with 

10 other type III RTK. 

The endothelial cell library can be further 
screened to isolate the 5' untranslated region and 
genomic clones can be generated so as to isolate the 
promoter region for the EB& gene. 

15 In addition to the DNA sequence described for 

the KDR gene (SEQ ID NO. 1) , the present invention 
further comprises DNA sequences which, by virtue of the 
redundancy of the genetic code, are biologically 
equivalent to the sequences which encode for the 

20 receptor, that is, these other DNA sequences are 

characterized by nucleotide sequences which differ from 
those set forth herein, but which encode a receptor 
having the seme amino acid sequences as those encoded 
by the DNA sequences set forth herein. 

25 In particular, the invention contemplates 

those DNA sequences which are sufficiently duplicative 
of the sequence of SEQ ID NO. 7 so as to permit 
hybridization therewith under standard high stringency 
Southern hybridization conditions, such as those 

30 described in Sambrook et al. (27), as well as the 

biologically active proteins produced thereby. 

This invention also comprises DNA sequences 
which encod amino acid s ijuenc s which differ from 
those of th novel r ceptor, but which are the 

35 biological equivalent to thos d scribed for th 
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receptor. Such euaino acid sequences nay be said to be 
biologically equivalent to those of the receptor if 
their sequences differ only by minor deletions from or 
conservative substitutions to the receptor sequence, 
5 such that the tertiary configurations of the sequences 

are essentially unchanged from those of the receptor. 

For example, a codon for the amino acid 
alanine, a hydrophobic amino acid, may be substituted 
by a codon encoding another less hydrophobic residue, 

10 such as glycine, or a more hydrophobic residue, such as 

valine, leucine, or isoleucine. Similarly, chemges 
which result in substitution of one negatively charged 
residue for another, such as aspartic acid for glutamic 
acid, or one positively charged residue for another, 

15 such as lysine for arginine, as well as chEuiges based 

on similarities of residues in their hydropathic index, 
can also be expected to produce a biologically 
equivalent product. Nucleotide changes which result in 
alteration of the N-terminal or C-terminal portions of 

20 the protein molecule would also not be expected to 

alter the activity of the protein. It may also be 
desirable to eliminate one or more of the cysteines 
present in the sequence, as the presence of cysteines 
may result in the undesirable formation of multimers 

25 when the protein is produced recombinantly, thereby 

complicating the purification and crystallization 
processes. Each of the proposed modifications is well 
within the routine skill in the art, as is determi- 
nation of retention of biological activity of the 

30 encoded products. Therefore, where the terms "gDR 

gene" or " KPR protein" are used in either the 
specification or the claims, each will be understood to 
encompass all such modifications and variations which 
r suit in th production of a biologically quivalent 

35 protein. 



wo 92/14748 PCr/US92/01300 



- 21 - 



In addition to the full length gene eaid 
protein, the invention encompasses biologically active 
fragments of each. By "biologically active" is meant a 
protein fragment which qualitatively retains the 
5 receptor activity of the larger KDR protein, or, in the 

case of a nucleotide sequence, which encodes such a 
protein fragment. It also refers, for purposes of 
antibody production, to fragments which are cap£Q}le of 
eliciting production of antibodies capable of binding 

10 to the receptor protein. 

To determine the size of the mSNA transcribed 
from the kdp gene. Northern blot hybridization 
experiments are carried out using an £csRI/£aQHI DNA 
segment (nucleotides 1510-2417, SEQ ID NO. 7) as a 

15 hybridization probe. The DNA used for the probe does 

not contain any portion of the putative kinase domain, 
and shares little sequence homology to other tyrosine 
kinases. The Northern blot emalysis (Figiire 10} shows 
that a 7 kb band is visualized in cytoplasmic poly (A) + 

20 RNA of ABAE bovine aortic endothelial cells. This 

transcript differs in size from previously reported 
transcripts for known type III RTK (7,18). 

The isolated cDNA is significant for several 
reasons. The cDNA encodes a novel type III receptor 

25 tyrosine kinase. The homology between the sequence of 

this CDNA and that of other receptors, as well as 
structural properties implied by the predicted amino 
acid sequence confirm the relationship. Receptors for 
growth factors should have tremendous utility in drug 

30 development as they face the outside of the cell and 

thus are among the best targets for drugs. In 
addition, the cellular levels of some receptors, in 
particular the neu proto-oncogen , increase during some 
cane rs. This has b en taken advantage of in designing 

35 diagnostic tests for these cane rs. 
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Southern analysis demonstrates that the kdp 
gene is present in mouse as well as human DNA. Mouse 
and human (Hela cell) DNA, 15 ng of each, are digested 
with 10 xinits of EcoRI and electrophoresed on a 0.7% 
agarose gel. The DHA is transferred onto 
nitrocellulose. The filter is hybridized to a 
[32pjcTp_iabelled cDKA probe made by nick translating 
an IcoRI/B^I fragment from the 5 • end of the kdp cDNA 
(nucleotides 1510-2417, SEQ ID NO. 7). Hybridization 
is conducted at 30°C in 5 X SSPE, 50% formamide, 0.1% 
SDS, plus 150 Atg/ml salmon sperm DNA. The DNA probe 
hybridizes to Southern blots containing ficoRI digested 
DNA. After 48 hours, the filter is washed at room 
temperature in 2 X SSC plus 0.1% SDS for 20 minutes, 
15 followed by two 20 minute washes at 40°C with 0.1 X SSC 

plus 0.1% SDS. Autoradiography is then performed for 
48 hours. As shown in Figure 11, radioactively 
labelled DNA is present in both human and mouse 
samples. This indicates that the kdp gene is present 

20 in both species. 

An experiment is conducted to ascertain the 
genetic locus of kdp on iaman chromosomes. 
Thirty-eight cell hybrids from 18 unrelated human cell 
lines and four mouse cell lines are examined (19) . A 

25 DNA probe hybridizes to Southern blots which contain 

EcoRI digested DNA from the human-mouse hybrids (using 
the procedure and DNA probe for human and mouse tissue 
described in relation to Figure 11) . Table I sets 
forth the results of the segregation of kdp with human 

30 chromosomes in EcoRI digested human-mouse somatic cell 

hybrid DNA: 



35 
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Concordant # Discordant # 
of Hybrids of Hybrids 
5 Chr<?ffi9S«?ffi9 (+/+) (+/') ('/+) % pipgprtancy 



1 


4 


19 


8 


4 


34 


2 


8 


18 


5 


6 


30 


3 


11 


12 


3 


9 


34 


4 


14 


24 


0 


0 


0 


5 


7 


14 


7 


10 


45 


6 


7 


19 


7 


5 


32 


7 


11 


14 


3 


8 


31 


8 


8 


11 


6 


13 


50 


9 


3 


20 


10 


4 


38 


10 


12 


9 


2 


14 


43 


11 


9 


13 


4 


11 


41 


12 


9 


10 


5 


14 


50 


13 


7 


18 


7 


6 


34 


14 


11 


8 


3 


16 


50 


15 


9 


15 


5 


8 


35 


16 


7 


19 


7 


5 


32 


17 


12 


7 


2 


16 


49 


18 


11 


14 


3 


10 


34 


19 


7 


18 


7 


6 


34 


20 


9 


10 


5 


14 


50 


21 


11 


9 


3 


15 


47 


22 


3 


16 


10 


7 


47 


X 


8 


10 


3 


8 


38 



30 

The scoring Is determined by the presence (+) 
or edssence (-) of human bzmds In the hybrids on 
Southern blots prepared In a similar to those shown In 
Figure 11. Th sc ring Is compar d to the presence or 
35 abs nee of hximan chr mos m s In each hybrid. A 0% 



wo 92/14748 



PCr/US92/01300 



- 24 - 



10 



discordancy indicates a matched segregation of the DNA 
probe with a chromosome. Three fragments, approxi- 
mately 6.5 kb, 3.1 kb, and 0.7 kb in size are detected 
in digests of human DNA (Figure 11) , and in all hybrids 
which had retained human chromosome 4 (Table I) . All 
other chromosomes are excluded in at least 11 
discordant hybrids (Table I) . The results of Figure 11 
and Table I demonstrate that the genetic locus of kdp 
is on human chromosome 4. 

It is noteworthy that both the sJsit (3) and 
the type A PDGF (28) receptor genes map to hiiinan 
chromosome 4. The finding that the genetic locus of 
kdp is on human chromosome 4 provides further evidence 
that the novel receptor of this invention is a type III 
15 receptor tyrosine kinase. 

The next step after identifying the entire 
coding portion of the kdp gene is to express the 
receptor protein encoded by that gene. The receptor 
protein is then utilized so as to identify the growth 
20 factor which binds specifically to the receptor. 

The receptor protein is expressed using 
established recombinant DNA methods. Suitable host 
organisms include bacteria, viruses, yeast, insect or 
mamnalian cell lines, as well as other conventional 
25 organisms. For example, CMT-3 monkey kidney cells are 

tranfected with a vector containing the complete coding 
region of the gene. 

The complete coding portion of the gene 
is assembled by sequentially cloning into pUC119 three 
30 DNA fragments derived from BTIII081.8, BTIII129.5, and 

BTIV169. First, a Smal-EcoRI fragment of clone 
BTIII129.5 (nucleotides 3152-4236, SEQ ID NO. 7) is 
blunt end d with Klenow polym rase and introduc d into 
a sma l sit in pUC119. Next, a SaEHI-Smal fragment of 
35 clon BTIII081.8 (nucl otid s 2418-3151, SEQ ID NO. 7) 
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Is introduced at a BaaH I-Snal site. Finally, a 
Sal i-BamHi fragment of clone BTIV169 (nucleotides 
1-2417, SEQ ID NO. 7} is introduced at a Sfill-lasHI 
site. Part of the cloning site of pUC119 is contained 
5 in the Sall-BanHI fragment, 5' to the KHB gene. In 

order to clone the complete coding portion into an 
expression vector, the assembled DKA (in pUC119) is 
digested with Sai l and AsfillS and recloned into the 
exikaryotic expression vector pcDNAltkpASP. 

10 This vector is a modification of the vector 

pcDNAl (Invitrogen; San Diego, CA) . Specifically, the 
amplcillin resistemce gene is cloned from pBR322 into 
pcDNAl. A small SV40 T splice and the SV40 polyadeny- 
lation signal are then removed and are replaced with a 

15 Herpes Simplex Virus-1 polyadenylation signal. 

Finally, a cytomegalovirus intermediate early splice is 
inserted 5' to the cloning site to yield pcDKAltkpASP . 

Transfection of CHT-3 cells is done using 
DEAE-dextran. Forty-eight hours after transfection, 

20 expression of the novel receptor is monitored using 

Western blot analysis as follows. 

An cmtibody is used to assay the expressed 
receptor protein. The predicted amino acid sequence of 
the receptor is used to generate peptide-derived 

25 antibodies to the receptor by conventional techniques. 

The presence of the novel receptor protein is confirmed 
by Western blot hybridization. 

Specifically, a synthetic peptide with 13 
residues is synthesized based on the 12 residues 

30 corresponding to amino acids 986-997 of the putative 

amino acid sequence of the KDR protein (SEQ ID NO. 7) , 
with a cysteine residue linked to the lysine (amino 
acid 997) . The cysteine facilitates coupling of the 
p ptide to a macromol cul which functions as a carrier 

35 for the peptid . For xample, th peptide is coupled 
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15 



to iceyhole limpet haeaocyanin (KLH) using m-maleimido- 
benzoyl-N-hydroxysuccinimide ester. Other conventional 
carriers may be used such as human and bovine serum 
albumins, myoglobins, /j-galactosidase, penicillinase 
and bacterial toxoids, as well as synthetic molecules 
such as multi-poly-DL-alanyl-poly-L-lysine and poly-L- 

lysine. 

Rabbits are immunized with the peptide-KLH 
conjugate to raise polyclonal antibodies. After 
different periods of time, serum is collected from the 
rabbits. The IgG fraction of the serum is then 
purified using a protein A Sepharose column (Pharmacia 
LKB, Uppsala, Sweden) to obtain the antibody which is 
designated anti-Z^.PS23. 

A sample of the expressed protein is 
subjected to SDS-PAGE using a 7% acrylamide gel under 
standard conditions. The protein band is then 
transferred on to nitrocellulose paper for Western blot 
analysis and the anti-ffiB.PS23 antibody is added at a 
dilution of 1:1,000 to allow the antibody to react with 
the protein present. A second antibody, goat 
anti-rabbit antibody to rabbit IgG, which binds to 
anti-B2E.PS23, is then added. The detection of 
proteins which react with the antibodies is performed 
25 by autoradiography of bands using an ECL system 

(Amersham, Chicago, IL) . The results are depicted in 
Figure 12. 

Figure 12 shows that a 190 kD protein is 
present in the cells transfected with the vector 

30 containing the SEE gene, but is absent in cells 

transfected with vector alone. The size of this 
protein is consistent with it being encoded by the SDR 
g n , in that th pr diet d amino acid s quenc for the 
unglycosylat d SBB prot in is 156 kD, and that sequence 

35 c ntains 18 putativ extracellular glycosylation sit s 



20 
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Which would account for the balzmce of the size seen in 
the 190 kD band. 

The expressed receptor is then used to 
identify the growth factor which interacts with the 
5 receptor. In order to test the hypothesis that the KDR 

protein is a receptor for VEGF, radioligand binding 
studios are performed. VEGF (provided by D. 

125 

Gospodarowicz} is radioleQ}elled with I. Cells are 
trans fected with either the vector pcDKAltkpASP alone 
10 (bars 1 and 2 of Figure 13) or with the vector 

containing the KDR gene (bars 3 and 4 } . Forty-eight 
hours later, the transfected cell samples are washed 
with PBS and then incubated for 90 minutes with 

125 

ser\am-free media containing 50 pM [ I]VEGF (specific 

15 activity equal to 4,000 cpm per fmol) . Excess 

nonradioactive VEGF, 5 nM, is added to some samples 

(bars 2 and 4) to define specific binding sites. The 

samples are washed with ice cold PBS, and the cells are 

transferred to geunma -counting tubes using a detergent, 

20 0.1% lubrol. 

The results of the radioligand binding 

studies are depicted in Figure 13. Figure 13 shows 

that CMT-3 cells transfected with vector containing the 

125 

EBB gene contain specific binding sites for [ I] VEGF 
25 (compare bars 3 and 4} , while cells transfected with 

vector alone do not (compare bars 1 and 2} . 

Further evidence that the KDR gene encodes a 
receptor for VEGF is demonstrated by affinity 
cross-linking studies (Figure 14) . Figure 14 depicts 
30 the results of affinity cross-linking of [^^^I]VEGF to 

CIfr-3 cells which express the KDR protein. CHT-3 cells 
are transfected with either the pcDNAltkpASP vector 
alone (Ian 1 of Figure 14) or with the vector 
containing the KDR gene (lane 2) . Forty-eight hours 
35 lat r, the cells are washed in PBS, and sertm fr e 
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media containing 200 pM i^^^l^VEGF is added. After 90 
minutes at room temperature, an affinity cross-linker 
disuccinimidyl suberate (Pierce Biochemicals , Rockford, 
XL), 0.5mM, is added for 15 minutes. The samples are 
5 then subjected to SDS-PAGE autoradiography. 

Three protein bands are seen in SDS-PAGE 
autoradiograms from samples of CMT-3 cells transfected 
with the ^ gene and cross-linked to [^^S^^^gp ^i^ne 
1) . The size of band 1 (235 kD) is consistent with it 

10 being the 190 kD protein seen by Western blot analysis 

(Figure 12) , because a 45 kD ["^I] VEGF dimer plus 190 
kD would migrate in a manner identical to band 1. The 
origin of band 2 is not clear, but may represent an 
altered glycosylation form of band 1. Band 3 (22.5 kD) 

15 is most likely VEGF itself, and can be seen faintly in 

cells transfected with vector alone (lane 2). 

The novel KDR gene of this invention is 
significant for several reasons. Studies of the 
cellular mechanisms by which receptors function in 

20 signal transduction have led in the past to a better 

understanding of how cells grow in both normal and 
diseased states. Receptor tyrosine kinases, in 
particular, have received a great deal of attention 
because of the observation that a number of RTK are the 

25 cellular counterparts for viral oncogenes, implying a 

direct correlation between changes in the expression of 
RTK and cancer. In view of this, it is likely that 
pharmaceuticals targeted at RTK will inhibit the 
changes in cell growth associated with cancer. In 

30 additon, it is likely that monitoring the levels of 

expression of RTK will prove valuable in diagnosing the 
onset of cancer. 

The d scrib d cDNA is isolated from a hviman 
endothelial cell library. Endoth lial c lis 

35 participate in angiogenesis, the formation of new blood 
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capillaries. Previous work directed towards 
identifying the growth factors which regulate 
angiogenesis have primarily focused upon FGF (13), 
although recent evidence has indicated that other 
5 growth factors may be involved as well (12,15,29). 

This evidence consists of the observations that: 1) 
FGF does not contain a signal sequence (24) and thus 
nay not be secreted from cells in a manner consistent 
with the tight regulation of angiogenesis, and 2) 

10 endothelial cells synthesize FGF and yet are normally 

resting (15) . Ovir discovery, then, of a novel growth 
factor receptor may ultimately clarify these 
inconsistencies and lead to a better understanding of 
endothelial cell function. 

15 The teachings of this invention can be 

readily used by those skilled the art for the purpose 
of testing pharmaceuticals targeted at the KDR protein. 
Two exiunples of approaches which can be used for this 
purpose are now given. 

20 First, the methods described in this 

invention for studying the interaction of VEGF with ZHR 
protein can be used to test for pharmaceuticals which 
will antagonize that interaction. For these studies, 
cells expressing the KDR protein are incubated with 

25 [^^^I]VEGF, together with a candidate pharmaceutical. 

Inhibition of radioligand binding is tested for; 
significant inhibition indicates the candidate is an 
antagonist. Permanent expression of the £DB protein in 
a cell type such as NIH3T3 cells would make these 

30 studies less ladDorious. This can be easily achieved by 

those skilled in the art using the described methods. 

Second, using the teachings of this 
inv ntion, those skill d in the art can study 
structural properties of the H2B protein inv Ived in 

35 r ceptor fxinction. This structural information can 
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then be used to more rationally design pharmaceuticals 
which inhibit that function. Mutagenesis of the 
gene by well established protocols is one approach, 
crystallization of the receptor binding site is 
5 another. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION; 

(i) APPLICANT: Tenazm, Bruce I 

Carrion, Miguel E 

(II) TITLE OF INVENTION: Identification of a 
Novel Human Growth Factor Receptor 

(III) NUMBER OF SEQUENCES: 14 

(Iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: Alan M. Gordon 

American Cyanamld Company 

(B) STREET: 1937 West Main Street, P.O. Box 60 

20 (C) CITY: Stamford 

(D) STATE: Connecticut 

(E) COUNTRY: USA 

25 

(F) ZIP: 06904 

(V) COMPUTER READABLE FORM: 
30 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC AT 

(C) OPERATING SYSTEM: MS-DOS 

35 



10 



wo 92/14748 PCrAJS92/01300 

- 34 - 

(D) SOFTWARE: ASCII from IBM DW 4 

(vi) OJRRENT APPLICATION DATA: 
5 (A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/657,236 

(B) FILING DATE: February 22, 1991 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Gordon, Alan M. 

(B) REGISTRATION NUMBER: 30,637 

(C) REFERENCE/DOCKET NUMBER: 31,298-01 
25 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 203 321 2719 

(B) TELEFAX: 203 321 2971 

30 

(C) TELEX: 

(2) INFORMATION FOR SEQ ID NO: 1: 
35 (i) SEQUENCE CHARACTERISTICS: 



15 



20 
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(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

5 

(C) STRANDEDNESSS ; single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GTCGAC AAY CTG TT6 GGR GCC TGC AAC 27 

15 

(2) INFORMATION FOR SEQ ID NO; 2 : 
(i) SEQUENCE CHARACTERISTICS: 

20 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESSS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
6AATTC AG CAC KTT NOT RGC YGC CAG GTC TGY GTC 35 



35 
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(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 363 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSS : single 

10 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAA TTC TGC AAA TTT GGA AAC CTG TCC ACT TAC CTG 36 
AGG ACG AAG AGA AAT GAA TTT GTC CCC TAC AAG ACC 72 

20 

AAA GGG GCA CGA TTC CGT CAA GGG AAA GAC TAC GTT 108 
GGA GCA ATC CCT GTG GAT CTG AAA CGG CGC TTG GAC 144 
25 ACG CAT CAC CAG TAG CCA GAG CTC AGC CAG CTC TGG 180 

ATT TGT GGA GGA GAA GTC CCT CAG TGA TGT AGA AGA 216 
AGA GGA AGC TCC TGA AGA TCT GTA TAA GGA CTT CCT 252 

30 

GAC CTT GGA GCA TCT CAT CTG TTA CAG TTT CCA AGT 288 
GGC TAA GGG CAT GGA GTT CTT GGC ATC GCG AAA GTG 324 
35 TAT CCA CAG AGA CCT GGC AGC CAG GAA CGT GCT GAA 360 
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TTC 363 
(2) INFORMATION FOR SEQ ID NO: 4 : 

5 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 251 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESSS : single 

(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
20 GTC GAC AAT CT6 TTG GGG GCC TGC ACC ATC CCA ACA 36 

TCC TGC TGC TCT ACA ACT ATT TTT ATG ACC GGA GGA 72 
GGA TCT ACT TGA TTC TAG AGT ATG CCC CCC GCG GAG 108 

25 

CTC TAC AAG GAG CTG CAG AAG AGC TGC ACA TTT GAC 144 
GAG CAG CGA ACA GCC ACG ATC ATG GAG GAG TTG GCA 180 
30 GAT GCT CTA ATG TAC TGC CGT GGG AAG AAG GTG ATT 216 

CAC AGA GAC CTG GCA GCC AGC AAC GTG CTG AAT TC 251 

35 (2) INFORMATION FOR SEQ ID NO: 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 base pairs 

5 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

15 

(A) NAME/KEY: PDGF Receptor DNA 

(B) LOCATION: internal sequence 
20 (X) PUBLICATION INFORMATION: 

(A) AUTHORS: Gronwald, R. , et al. 

(B) JOURNAL: Proc. Natl. Acad. Sci. 

25 

(C) VOLUME: 85 

(D) PAGES: 3435-3439 
30 (E) DATE: 1988 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AAC CTG TGG GGG CCT GCA CCA AAG GAG GAC CAT CTA 36 

35 
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TAT CAT CTA TAT CAT CAC TGA GTA CTG CCG CTA CGG 72 

AGA COT 6GT 66A CTA CCT GCA CCG CAA CAA ACA CAC 108 

CTT CCT GCA GCA CCA CTC CGA CAA GCG CCG CCC GCC 144 

CA6 CGC GGA GCT CTA CAG CAA TGC TCT GCC CGT TGG 180 

GCT CCC CCT GCC CAG CCA TGT GTC CTT GAC CGG GGG 216 

AGA GCG ACG GTG GCT ACA TGG ACA TGA GCA AGG ACG 252 

AGT CGG TGG ACT ATG TGC CCA TGC TGG ACA TGA AAG 288 

15 GAG ACG TGA AAT AGC AGA CAT CGA GTC CTC CAA CTA 324 

CAT GGC CCC TTA CGA TAA CTA CGT TCC CTC TGC CCC 360 

TGA GAG GAC CTG CCG AGC AAC TTT GAT CAA CGA GTC 396 

20 

TCC AGT GCT AAG CTA CAT GGA CCT CGT GGG CTT CAG 432 

CTA CCA GGT GGC CAA TGG CAT GGA GTT CTG GCC TCC 468 

25 AAG AAC TGC GTC CAC AGA GAC CTG GCG GCT AGG AAC 504 



10 



GTC CTT 510 

30 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 255 bas pairs 

35 
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(B) TYPE: nucleic acid 

(C) SXRANDEDNESSS: single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

10 

(A) NAME/KEY: FGF Receptor DNA 

(B) LOCATION: Internal sequence 
15 (X) PUBLICATION INFORMATION: 

(A) AUTHORS; Ruta, M. , et al. 

(B) JOURNAL: Oncogene 

20 

(C) VOLUME: 3 
CD) PAGES: 9-15 

25 (E) DATE: 1988 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AAC CTG CTG GGG GCC TGC ACG CAG GAT GGT CCC TTG 36 

30 

TAT 6TC ATC GTG GAG TAT GCC TCC AAG GGC AAC CTG 72 
CGG GAG TAC CTG CAG ACC CGG AGG CCC CCA GGG CTG 108 
35 GAA TAC TGC TAT AAC CCC AGC CAC AAC CCA GAG GAG 144 
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CAG CTC TCC TCC AAG GAC CTG GTG TCC TGC GCC TAG 180 
CAG GAG GCC CGA GGC ATG GAG TAT CTG GCC TCC AAG 216 
5 AAG TGC ATA CAC CGA GAC CTG GCA GCC AGG AAT GTC 252 

CTG 255 
(2) INFORMATION FOR SEQ ID NO: 7: 

10 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4236 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

25 ATG GAG A6C AAG GTG CTG CTG GCC GTC GCC CTG 33 

Met Glu Ser Lys Val Leu Leu Ala Val Ala Leu 
15 10 

TGG CTC TGC GTG GAG ACC CGG GCC GCC TCT GTG GGT 69 
30 Trp Leu Cys Val Glu Thr Arg Ala Ala Ser Val Gly 

15 20 

TTG CCT AGT GTT TCT CTT GAT CTG CCC AGG CTC AGC 105 
Leu Pro Ser Val Ser Leu Asp Leu Pro Arg Leu Ser 
35 25 30 35 
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ATA CAA AAA GAC ATA CTT ACA ATT AAG GCT AAT ACA 
lie Gin Lys Asp He Leu Thr He Lys Ala Asn Thr 

40 45 



141 



ACT CTT CAA ATT ACT T6C AG6 GGA CAG AGG GAC TTG 
Thr Leu Gin He Thr Cys Arg Gly Gin Arg Asp Leu 
50 55 



177 



10 



GAC TGG CTT TGG CCC AAT AAT CAG AGT GGC AGT GAG 
Asp Trp Leu Trp Pro Asn Asn Gin Ser Gly Ser Glu 
60 65 70 



213 



15 



CAA AGG GTG GAG GTG ACT GAG TGC A6C GAT GGC CTC 
Gin Arg Val Glu Val Thr Glu Cys Ser Asp Gly Leu 
75 80 



249 



20 



TTC TGT AAG ACA CTC ACA ATT CCA AAA GTG ATC GGA 285 
Phe Cys Lys Thr Leu Thr He Pro Lys Val He Gly 
85 90 95 

AAT GAC ACT GGA GCC TAC AAG TGC TTC TAC CGG GAA 321 
Asn Asp Thr Gly Ala Tyr Lys Cys Phe Tyr Arg Glu 
100 105 



25 



ACT GAC TTG GCC TCG GTC ATT TAT GTC TAT GTT CAA 
Thr Asp Leu Ala Ser Val He Tyr Val Tyr Val Gin 
110 115 



357 



30 



35 



GAT TAC AGA TCT CCA TTT ATT GCT TCT GTT AGT GAC 
Asp Tyr Arg Ser Pro Phe He Ala Ser Val Ser Asp 
120 125 130 

CAA CAT GGA GTC GTG TAC ATT ACT GAG AAC AAA AAC 
Gin His Gly Val Val Tyr H Thr Glu Asn Lys Asn 
135 140 



393 



429 
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AAA ACT GTG GTG ATT CCA TGT CTC GGG TCC ATT TCA 
Lys Thr Val Val lie Pro Cys Leu Gly Ser lie Ser 
145 150 155 



465 



AAT CTC AAC GTG TCA CTT TGT GCA AGA TAC CCA GAA 
Asn Leu Asn Val Ser Leu Cys Ala Arg Tyr Pro Glu 
160 165 



501 



10 



AAG AGA TTT 6TT CCT GAT GGT AAC AGA ATT TCC TGG 
Lys Arg Phe Val Pro Asp Gly Asn Arg He Ser Trp 
170 175 



537 



15 



GAC AGC AAG AAG GGC TTT ACT ATT CCC AGC TAC ATG 
Asp Ser Lys Lys Gly Phe Thr He Pro Ser Tyr Met 
180 185 190 



573 



20 



ATC AGC TAT GCT GGC ATG GTC TTC TGT GAA GCA AAA 609 
He Ser Tyr Ala Gly Met Val Phe Cys Glu Ala Lys 
195 200 

ATT AAT GAT GAA AGT TAC CAG TCT ATT ATG TAC ATA 645 
He Asn Asp Glu Ser Tyr Gin Ser He Met Tyr He 
205 210 215 



25 



GTT GTC GTT GTA GGG TAT AGG ATT TAT GAT GTG GTT 
Val Val Val Val Gly Tyr Arg He Tyr Asp Val Val 
220 225 



681 



30 



CTG AGT CCG TCT CAT GGA ATT GAA CTA TCT GTT GGA 
Leu Ser Pro Ser His Gly He Glu Leu Ser Val Gly 
230 235 



717 



35 



GAA AAG 
Glu Lys 
240 



CTT GTC TTA AAT TGT ACA GCA AGA ACT GAA 
Leu Val Leu Asn Cys Thr Ala Arg Thr Glu 
245 250 



753 
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CTA AAT GTG 6GG ATT GAC TTC AAC TGG GAA TAC CCT 789 
Leu Asn Val Gly lie Asp Phe Asn Trp Glu Tyr Pro 
255 260 

5 TCT TCG AAG CAT CAG CAT AAG AAA CTT GTA AAC CGA 825 

Ser Ser Lys His Gin His Lys Lys Leu Val Asn Arg 
265 270 275 

GAC CTA AAA ACC CAG TCT GGG AGT GAG ATG AAG AAA 861 
10 Asp Leu Lys Thr Gin Ser Gly Ser Glu Met Lys Lys 

280 285 

TTT TTG AGC ACC TTA ACT ATA GAT 6GT GTA ACC CGG 897 
Phe Leu Ser Thr Leu Thr lie Asp Gly Val Thr Arg 
15 290 295 

AGT GAC CAA GGA TTG TAC ACC TGT GCA GCA TCC AGT 933 
Ser Asp Gin Gly Leu Tyr Thr Cys Ala Ala Ser Ser 
300 305 310 

20 

GGG CTG ATG ACC AAG AAG AAC AGC ACA TTT GTC AGG 969 
Gly Leu Met Thr Lys Lys Asn Ser Thr Phe Val Arg 
315 320 

25 GTC CAT GAA AAA CCT TTT GTT GCT TTT GGA AGT GGC 1005 

Val His Glu Lys Pro Phe Val Ala Phe Gly Ser Gly 
325 330 335 

ATG GAA TCT CTG GTG GAA GCC ACG GTG GGG GAG C6T 1041 
30 Met Glu Ser Leu Val Glu Ala Thr Val Gly Glu Arg 

340 345 

GTC AGA ATC CCT GCG AAG TAC CTT GGT TAC CCA CCC 1077 
Val Arg II Pro Ala Lys Tyr Leu Gly Tyr Pro Pro 
35 350 355 
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CCA GAA ATA AAA TGG TAT AAA AAT GGA ATA CCC CTT 1113 
Pro Glu lie Lys Trp Tyr Lys Asn Gly lie Pro Leu 
360 365 370 

5 GAG TCC AAT CAC ACA ATT AAA GCG GGG CAT GTA CTG 1149 

Glu Ser Asn His Thr lie Lys Ala Gly His Val Leu 
375 380 

ACG ATT ATG GAA 6TG AGT GAA AGA GAC ACA GGA AAT 1185 
10 Thr lie Met Glu Val Ser Glu Arg Asp Thr Gly Asn 

385 390 395 

TAC ACT GTC ATC CTT ACC AAT CCC ATT TCA AAG GAG 1221 
Tyr Thr Val lie Leu Thr Asn Pro lie Ser Lys Glu 
15 400 405 

AAG CAG AGC CAT GTG GTC TCT CTG GTT GTG TAT GTC 1257 
Lys Gin Ser His Val Val Ser Leu Val Val Tyr Val 
410 415 

20 

CCA CCC CAG ATT GGT GAG AAA TCT CTA ATC TCT CCT 1293 
Pro Pro Gin lie Gly Glu Lys Ser Leu lie Ser Pro 
420 425 430 

25 GTG GAT TCC TAC CAG TAC GGC ACC ACT CAA ACG CTG 1329 

Val Asp Ser Tyr Gin Tyr Gly Thr Thr Gin Thr Leu 
435 440 

ACA TGT ACG GTC TAT GCC ATT CCT CCC CCG CAT CAC 1365 
30 Thr Cys Thr Val Tyr Ala lie Pro Pro Pro. His His 

445 450 455 



35 



ATC CAC TGG TAT TGG CAG TTG GAG GAA GAG TGC GCC 
11 His Trp Tyr Trp Gin Leu Glu Glu Glu Cys Ala 
460 465 



1401 
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AAC GAG CCC AGC CAA GCT GTC TCA GTG ACA AAC CCA 1437 
ABn Glu Pro Ser Gin Ala Val Ser Val Thr Asn Pro 
470 475 

5 TAC CCT TGT GAA GAA TGG AGA AGT GTG GAG GAC TTC 1473 

Tyr Pro Cys Glu Glu Trp Arg Ser Val Glu Asp Phe 
480 485 490 

CAG GGA GGA AAT AAA ATT GAA GTT AAT AAA AAT CAA 1509 
10 Gin Gly Gly Asn Lys lie Glu Val Asn Lys Asn Gin 

495 500 

TTT GCT CTA ATT GAA GGA AAA AAC AAA ACT GTA AGT 1545 
Phe Ala Leu lie Glu Gly Lys Asn Lys Thr Val Ser 
15 505 510 515 

ACC CTT GTT ATC CAA GCG GCA AAT GTG TCA GCT TTG 1581 
Thr Leu Val lie Gin Ala Ala Asn Val Ser Ala Leu 
520 525 

20 

TAC AAA TGT GAA GCG GTC AAC AAA GTC GGG AGA GGA 1617 
Tyr Lys Cys Glu Ala Val Asn Lys Val Gly Arg Gly 
530 535 

25 GAG AGG GTG ATC TCC TTC CAC GTG ACC AGG GGT CCT 1653 

Glu Arg Val He Ser Phe His Val Thr Arg Gly Pro 
540 545 550 

GAA ATT ACT TTG CAA CCT GAC ATG CAG CCC ACT GAG 1689 
30 Glu He Thr Leu Gin Pro Asp Met Gin Pro Thr Glu 

555 560 

CAG GAG AGC GTG TCT TTG TGG TGC ACT GCA GAC AGA 1725 
Gin Glu Ser Val Ser Leu Trp Cys Thr Ala Asp Arg 
35 565 570 575 



wo 92/14748 



PCr/US92/01300 



- 47 - 



TCT ACG TTT GAG AAC CTC ACA TGG TAG AAG CTT GGC 1761 
Ser Thr Phe Glu Asn Leu Thr Trp Tyr Lys Leu Gly 
580 585 

5 CCA CAG CCT CT6 CCA ATC CAT GT6 6GA GAG TT6 CCC 1797 

Pro Gin Pro Leu Pro lie His Val Gly Glu Leu Pro 
590 595 

ACA CCT GTT TGC AAG AAC TTG GAT ACT CTT TGG AAA 1833 
10 Thr Pro Val Cys Lys Asn Leu Asp Thr Leu Trp Lys 

600 605 610 

TTG AAT GCC ACC ATG TTC TCT AAT AGC ACA AAT GAC 1869 
Leu Asn Ala Thr Met Phe Ser Asn Ser Thr Asn Asp 
15 615 620 

ATT TTG ATC ATG GAG CTT AAG AAT GCA TCC TTG CAG 1905 
lie Leu lie Met Glu Leu Lys Asn Ala Ser Leu Gin 
625 630 635 

20 

GAC CAA GGA GAC TAT GTC TGC CTT GCT CAA GAC AGG 1941 
Asp Gin Gly Asp Tyr Val Cys Leu Ala Gin Asp Arg 
640 645 

25 AAG ACC AAG AAA AGA CAT TGC GTG GTC AGG CAG CTC 1977 

Lys Thr Lys Lys Arg His Cys Val Val Arg Gin Leu 
650 655 



30 



ACA GTC CTA GAG CGT GTG GCA CCC ACG ATC ACA GGA 2013 
Thr Val Leu Glu Arg Val Ala Pro Thr lie Thr Gly 
660 665 670 



35 



AAC CTG GAG AAT CAG ACG ACA AGT ATT GG6 GAA AGC 
Asn Leu Glu Asn Gin Thr Thr Ser lie Gly Glu S r 
675 680 



2049 
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ATC GAA GTC TCA TGC ACG GCA TCT GGG AAT CCC CCT 2085 
lie Glu Val Ser Cys Thr Ala Ser Gly Asn Pro Pro 
685 690 695 

CCA CAG ATC ATG TGG TTT AAA GAT AAT GAG ACC CTT 2121 
Pro Gin lie Met Trp Phe Lys Asp Asn Glu Thr Leu 
700 705 

GTA GAA GAC TCA GGC ATT GTA TTG AAG GAT GGG AAC 2157 
Val Glu Asp Ser Gly lie Val Leu Lys Asp Gly Asn 
710 715 



CGG AAC CTC ACT ATC CGC AGA GTG AGG AAG GAG GAC 2193 
Arg Asn Leu Thr He Arg Arg Val Arg Lys Glu Asp 
15 720 725 730 

GAA GGC CTC TAC ACC TGC CAG GCA TGC AGT GTT CTT 2229 
Glu Gly Leu Tyr Thr Cys Gin Ala Cys Ser Val Leu 
735 740 

20 

GGC TGT GCA AAA GTG GAG GCA TTT TTC ATA ATA GAA 2265 
Gly Cys Ala Lys Val Glu Ala Phe Phe He He Glu 
745 750 755 

25 GGT GCC CAG GAA AAG ACG AAC TTG GAA ATC ATT ATT 2301 

Gly Ala Gin Glu Lys Thr Asn Leu Glu He He He 
760 765 

CTA GTA GGC ACG ACG GTG ATT GCC ATG TTC TTC TGG 2337 
30 Leu Val Gly Thr Thr Val He Ala Met Phe Phe Trp 

770 775 



35 



CTA CTT CTT GTC ATC ATC CTA GGG ACC GTT AAG CGG 
Leu Leu Leu Val He II Leu Gly Thr Val Lys Arg 
780 785 790 



2373 
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GCC AAT 66A GGG GAA CTG AAG ACA 6GC TAC TTG TCC 2409 
Ala Asn Gly Gly Glu Leu Lys Thr Gly Tyr Leu Ser 
795 800 

5 ATC GTC ATG GAT CCA GAT GAA CTC CCA TTG GAT GAA 2445 

lie Val Met Asp Pro Asp Glu Leu Pro Leu Asp Glu 
805 SIO 815 

CAT TGT GAA CGA CTG CCT TAT GAT GCC AGC AAA TGG 2481 
10 His Cys Glu Arg Leu Pro Tyr Asp Ala Ser Lys Trp 

820 825 

GAA TTC CCC AGA GAC C6G CTG AAC CTA GGT AAG CCT 2517 
Glu Phe Pro Arg Asp Arg Leu Asn Leu Gly Lys Pro 
15 830 835 

CTT GGC CGT GGT GCC TTT GGC CAA GAG ATT GAA GCA 2553 

Leu Gly Arg Gly Ala Phe Gly Gin Glu lie Glu Ala 

840 845 850 

20 

GAT GCC TTT GGA ATT GAC AAG ACA GCA ACT TGC AGG 2589 

Asp Ala Phe Gly lie Asp Lys Thr Ala Thr Cys Arg 
855 860 

25 ACA 6TA GCA GTC AAA ATG TTG AAA GAA GGA GCA ACA 2625 

Thr Val Ala Val Lys Met Leu Lys Glu Gly Ala Thr 
865 870 875 

CAC AGT GAG CAT CGA GCT CTC ATG TCT GAA CTC AAG 2661 
30 His Ser Glu His Arg Ala Leu Met Ser Glu Leu Lys 

880 885 



35 



ATC CTC ATT CAT ATT GGT CAC CAT CTC AAT GTG GTC 
He Leu He His He Gly His His Leu Asn Val Val 
890 895 



2697 
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AAC CTT CTA GGT GCC TGT ACC AAG CCA GGA GGG CCA 2733 
Asn Leu Leu Gly Ala Cys Thr Lys Pro Gly Gly Pro 
900 905 910 

5 CTC ATG GTG ATT GTG GAA TTC TGC AAA TTT GGA AAC 2769 

Leu Met Val lie Val Glu Phe Cys Lys Phe Gly Asn 
915 S20 

CTG TCC ACT TAC CTG AGG AGC AAG AGA AAT GAA TTT 2805 
10 Leu Ser Thr Tyr Leu Arg Ser Lys Arg Asn Glu Phe 

925 930 935 

GTC CCC TAC AAG ACC AAA GGG GCA CGA TTC CGT CAA 2841 
Val Pro Tyr Lys Thr Lys Gly Ala Arg Phe Arg Gin 
15 940 945 

GGG AAA GAC TAC GTT GGA GCA ATC CCT GTG GAT CTG 2877 
Gly Lys Asp Tyr Val Gly Ala lie Pro Val Asp Leu 
950 955 

20 

AAA CGG CGC TT6 GAC AGC ATC ACC AGT AGC CAG AGC 2913 
Lys Arg Arg Leu Asp Ser lie Thr Ser Ser Gin Ser 
960 965 970 

25 TCA GCC AGC TCT GGA TTT GTG GAG GAG AAG TCC CTC 2949 

Ser Ala Ser Ser Gly Phe Val Glu Glu Lys Ser Leu 
975 980 

AGT GAT GTA GAA GAA GAG GAA GCT CCT GAA GAT CTG 2985 
30 Ser Asp Val Glu Glu Glu Glu Ala Pro Glu Asp Leu 

985 990 995 



35 



TAT AAG GAC TTC CTG ACC TTG GAG CAT CTC ATC TGT 
Tyr Lys Asp Ph Leu Thr Leu Glu His Leu lie Cys 
1000 1005 



3021 
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TAC AGC TTC CAA 6T6 GCT AA6 6GC ATG GAG TTC TTG 3057 
Tyr Ser Phe Gin Val Ala Lys Gly Met Glu Phe Leu 
1010 1015 

5 GCA TCG CGA AAG T6T ATC CAC AGG GAC CTG GCG GCA 3093 

Ala Ser Arg Lys Cys lie His Arg Asp Leu Ala Ala 
1020 1025 1030 

CGA AAT ATC CTC TTA TCG GAG AAG AAC GTG 6TT AAA 3129 
10 Arg Asn lie Leu Leu Ser Glu Lys Asn Val Val Lys 

1035 1040 

ATC TGT GAC TTT 6GC TTG GCC CGG GAT ATT TAT AAA 3165 
lie Cys Asp Phe Gly Leu Ala Arg Asp lie Tyr Lys 
15 1045 1050 1055 

GAT CCA GAT TAT GTC A6A AAA GGA GAT GCT CGC CTC 3201 
Asp Pro Asp Tyr Val Arg Lys Gly Asp Ala Arg Leu 
1060 1065 

20 

CCT TTG AAA TGG ATG GCC CCA GAA ACA ATT TTT GAC 3237 
Pro Leu Lys Trp Met Ala Pro Glu Thr lie Phe Asp 
1070 1075 

25 AGA GTG TAC ACA ATC CAG AGT GAC GTC TGG TCT TTT 3273 

Arg Val Tyr Thr lie Gin Ser Asp Val Trp Ser Phe 
1080 1085 1090 



30 



GGT GTT TTG CTG TGG GAA ATA TTT TCC TTA GGT GCT 3309 
Gly Val Leu Leu Trp Glu He Phe Ser Leu Gly Ala 
1095 1100 



35 



TCT CCA TAT CCT GGG 6TA AAG ATT GAT GAA 
Ser Pro Tyr Pro Gly Val Lys He Asp Glu 
1105 1110 



GAA TTT 
Glu Phe 
1115 



3345 
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TGT AGG CGA TTG AAA GAA GGA ACT AGA ATG AG6 GCC 3381 
Cys Arg Arg Leu Lys Glu Gly Thr Arg Met Arg Ala 
1120 1125 

5 CCT GAT TAT ACT ACA CCA GAA ATG TAC CAG ACC ATG 3417 

Pro Asp Tyr Thr Thr Pro Glu Met Tyr Gin Thr Met 
1130 1135 

CTG GAC TGC TGG CAC GGG GAG CCC ACT CAG AGA CCC 3453 
10 Leu Asp Cys Trp His Gly Glu Pro Ser Gin Arg Pro 

1140 1145 1150 

ACG TTT TCA GAG TTG GTG GAA CAT TTG GGA AAT CTC 3489 
Thr Phe Ser Glu Leu Val Glu His Leu Gly Asn Leu 
15 1155 1160 

TTG CAA GCT AAT GCT CAG CAG GAT GGC AAA GAC TAC 3525 
Leu Gin Ala Asn Ala Gin Gin Asp Gly Lys Asp Tyr 
1165 1170 1175 

20 

ATT GTT CTT CCG ATA TCA GAG ACT TTG A6C ATG GAA 3561 
lie Val Leu Pro lie Ser Glu Thr Leu Ser Met Glu 
1180 1185 

25 GAG GAT TCT GGA CTC TCT CTG CCT ACC TCA CCT GTT 3597 

Glu Asp Ser Gly Leu Ser Leu Pro Thr ser Pro Val 
1190 1195 

TCC TGT ATG GAG GAG GAG GAA GTA TGT GAC CCC AAA 3633 
30 ser Cys Met Glu Glu Glu Glu Val Cys Asp Pro Lys 

1200 1205 1210 

TTC CAT TAT GAC AAC ACA GCA GGA ATC AGT CAG TAT 3669 
Ph His Tyr Asp Asn Thr Ala Gly lie s r Gin Tyr 
35 1215 1220 
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CTG CA6 AAC A6T AA6 CGA AA6 AGC CGG CCT GTG AGT 3705 
Leu Gin Asn Ser Lys Arg Lys Ser Arg Pro Val Ser 
1225 1230 1235 

5 GTA AAA ACA TTT 6AA GAT ATC CCG TTA GAA GAA CCA 3741 

Val Lys Thr Phe Glu Asp lie Pro Leu Glu Glu Pro 
1240 1245 

GAA GTA AAA GTA ATC CCA GAT GAC AAC CAG ACG 6AC 3777 
10 Glu Val Lys Val lie Pro Asp Asp Asn Gin Thr Asp 

1250 1255 

AGT GGT ATG GTT CTT GCC TCA GAA GAG CTG AAA ACT 3813 
Ser Gly Met Val Leu Ala Ser Glu Glu Leu Lys Thr 
15 1260 1265 1270 

TT6 GAA GAC AGA ACC AAA TTA TCT CCA TCT TTT GGT 3849 
Leu Glu Asp Arg Thr Lys Leu Ser Pro Ser Phe Gly 
1275 1280 

20 

GGA ATG GTG CCC AGC AAA AGC AGG GAG TCT GTG GCA 3885 
Gly Met Val Pro Ser Lys Ser Arg Glu Ser Val Ala 
1285 1290 1295 

25 TCT GAA GGC TCA AAC CAG ACA AGC GGC TAC CAG TCC 3921 

Ser Glu Gly Ser Asn Gin Thr Ser Gly Tyr Gin Ser 
1300 1305 

GGA TAT CAC TCC GAT GAC ACA GAC ACC ACC GTG TAC 3957 
30 Gly Tyr His Ser Asp Asp Thr Asp Thr Thr Val Tyr 

1310 1315 



35 



TCC AGT GAG GAA GCA GAA CTT TTA AAG 
Ser Ser Glu Glu Ala Glu Leu Leu Lys 
1320 1325 



CTG ATA GAG 
Leu II Glu 
1330 



3993 
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ATT GGA 6TG CAA ACC GGT AGC ACA GCC CAG ATT CTC 4029 
He Gly Val Gin Thr Gly Ser Thr Ala Gin He Leu 
1335 1340 

5 CAG CCT GAC ACG GGG ACC ACA CTG AGC TCT CCT CCT 4065 

Gin Pro Asp Thr Gly Thr Thr Leu Ser Ser Pro Pro 
1345 1350 1355 

GTT TAAAAGGAAG CATCCACACC CCAACTCCCG GACATCACAT 4108 

10 Val 
1356 

GAGAGGTCTG CTCAGATTTT GAAGTGTTGT TCTTTCCACC 4148 
15 AGCAGGAAGT AGCCGCATTT GATTTTCATT TCGACAACAG 4188 

AAAAAGGACC TC6GACTGCA GGGAGCCAGC TCTTCTAGGC 4228 



20 



TTGTGACC ^^36 



(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 433 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESSS : 

30 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptid 

35 (ix) FEATURE; 
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(A) NAME/KEY: cklt proto-oncogene receptor 

(B) LOCATION: Amino acids 543-975 

5 

(X) PUBLICATION INFORMATION: 

(A) AUTHORS: Yarden, Y., et al. 
10 (B) JOURNAL: EMBO J. 

(C) VOLUME: 6 

(D) PAGES: 3341-3351 

15 

(E) DATE: 1987 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

20 Leu Thr Tyr Lys Tyr Leu Gin Lys Pro Met Tyr Glu Val Gin 

543 545 550 555 

Trp Lys Val Val Glu Glu lie Asn Gly Asn Asn Tyr Val Tyr 
560 565 570 

25 

lie Asp Pro Thr Gin Leu Pro Tyr Asp His Lys Trp Glu Phe 
575 580 

Pro Arg Asn Arg Leu Ser Phe Gly Lys Thr Leu Gly Ala Gly 
30 585 590 595 

Ala Phe Gly Lys Val Val Ala Glu Thr Ala Tyr Gly Leu He 
600 605 610 

35 Lys Ser Asp Ala Ala Met Thr Val Ala Val Lys Met Leu Lys 
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615 



620 



625 



Pro Ser Ala His Leu Thr Glu Arg Glu Ala Leu Met Ser Glu 

635 640 



630 



Leu Lys val Leu Ser Tyr Leu Gly Asn His Met Asn He Val 



645 



650 



10 655 



15 



20 



Asn Leu Leu Gly Ala Cys Thr He Gly Gly Pro Thr Leu Val 

660 665 



He Thr Glu Tyr Cys Cys Tyr Gly Asp Leu Leu Asn Phe Leu 
670 675 680 

Arg Arg Lys Arg Asp Ser Phe He Cys Ser Lys Gin Glu Asp 
685 690 695 

His Ala Glu Ala Ala Leu Tyr Lys Asn Leu Leu His Ser Lys 
700 705 710 

Glu ser ser cys ser Asp ser Thr Asn Glu Tyr Met Asp Met 
715 720 



25 725 



Lys pro Gly Val Ser Tyr Val Val Pro Thr Lys Ala Asp Lys 

730 735 



Arg Arg Ser Val Arg He Gly Ser Tyr He Glu Arg Asp Val 
740 745 750 



30 



35 



Thr Pro Ala He Met Glu Asp Asp Glu Leu Ala Leu Asp Leu 
755 760 765 

Glu Asp Leu Leu Ser Phe Ser Tyr Gin Val Lys Gly Met Ala 



770 



775 



780 
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Phe Leu Ala Ser Lys Asn Cys lie His Arg Asp Leu Ala Ala 
785 790 

Arg Asn lie Leu Leu Thr His Gly Arg lie Thr Lys lie Cys 
5 795 800 805 

Asp Fhe Gly Leu Ala Arg Asp lie Lys Asn Asp Ser Asn Tyr 
810 815 820 



10 Val Val Lys Gly Asn Ala Arg Leu Pro Val Lys Val Met Ala 

825 830 835 

Pro Glu Ser lie Phe Asn Cys Val Tyr Thr Glu Glu Ser Asp 
840 845 850 

15 

Val Trp Ser Tyr Gly lie Phe Leu Trp Glu Leu Phe Ser Leu 
855 860 

Gly Ser Ser Pro Tyr Pro Gly Met Pro Val Lys Ser Lys Phe 
20 865 870 875 

Tyr Lys Met lie Lys Glu Gly Phe Arg Met Leu Ser Pro Glu 
880 885 890 

25 His Ala Pro Ala Glu Met Tyr Asp lie Met Lys Thr Cys Trp 

895 900 905 

Asp Ala Asp Pro Leu Lys Arg Pro Thr Phe Lys Gin lie Val 
910 915 920 

30 

Gin Leu lie Glu Lys Gin lie Ser Glu Ser Thr Asn His lie 
925 930 



35 



Tyr Ser Asn Leu Ala Asn Cys Ser Pro Asn Arg Gin Lys Pro 
935 940 945 
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Val Val Asp His Ser Val Arg lie Asn Ser Val Gly Ser Thr 
950 955 960 

Ala Ser Ser Ser Gin Pro Leu Leu Val His Asp Asp Val 
5 965 970 975 

(2) INFOFMATION FOR SEQ ID HO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

10 

(A) LENGTH: 437 amino acids 

(B) TYPE: amino acid 
15 (C) STRANDEDNESSS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

20 

(ix) FEATURE: 

(A) NAME/KEY: CSF-1 receptor 
25 (B) LOCATION: Amino acids 536-972 

(X) PUBLICATION INFORMATION: 

(A) AUTHORS: coussens, L. , et al. 

30 

(B) JOURNAL: Nature 

(C) VOLUME: 320 

35 (D) PAGES: 277-280 



wo 92/14748 



PCr/US92/01300 



- 59 - 



(£) DATE: 19B6 



5 (xl) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Leu Leu Tyr Lys Tyr Lys Gin Lys Pro Lys Tyr Gin Val Arg 
536 540 545 

10 Trp Lys lie lie Glu Ser Tyr Glu Gly Asn Ser Tyr Thr Phe 

550 555 560 

lie Asp Pro Thr Gin Leu Pro Tyr Asn Glu Lys Trp Glu Phe 
565 570 575 

15 

Pro Arg Asn Asn Leu Gin Phe Gly Lys Thr Leu Gly Ala Gly 
580 585 590 

Ala Phe Gly Lys Val Val Glu Ala Thr Ala Phe Gly Leu Gly 
20 595 600 605 

Lys Glu Asp Ala Val Leu Lys Val Ala Val Lys Met Leu Lys 
610 615 

25 Ser Thr Ala His Ala Asp Glu Lys Glu Ala Leu Met Ser Glu 

620 625 630 

Leu Lys lie Met Ser His Leu Gly Gin His Glu Asn lie Val 
635 640 645 



30 



Asn Leu Leu Gly Ala Cys Thr His Gly Gly Pro Val Leu Val 
650 655 660 



35 



lie Thr Glu Tyr Cys Cys Tyr Gly Asp Leu Leu Asn Phe Leu 
665 670 675 
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Arg Arg Lys Ala Glu Ala Met Leu Gly Pro Ser Leu Ser Pro 
680 685 

Gly Gin Asp Pro Glu Gly Gly Val Asp Tyr Lys Asn lie His 
5 690 695 700 

Leu Glu Lys Lys Tyr Val Arg Arg Asp Ser Gly Fhe Ser Ser 
705 710 715 

10 Gin Gly Val Asp Thr Tyr Val Glu Met Arg Pro Val Ser Thr 

720 725 730 

Ser Ser Asn Asp Ser Phe Ser Glu Gin Asp Leu Asp Lys Glu 
735 740 745 

15 

Asp Gly Arg Pro Leu Glu Leu Arg Asp Leu Leu His Phe Ser 
750 755 

Ser Gin Val Ala Gin Gly Met Ala Phe Leu Ala Ser Lys Asn 
20 760 765 770 

Cys lie His Arg Asp Val Ala Ala Arg Asn Val Leu Leu Thr 
775 780 785 

25 Asn Gly His Val Ala Lys lie Gly Asp Phe Gly Leu Ala Arg 

790 795 800 

Asp lie Met Asn Asp Ser Asn Tyr lie Val Lys Gly Asn Ala 
805 810 815 



30 



Arg Leu Pro Val Lys Trp Met Ala Pro Glu Ser lie Phe Asp 
820 825 



Cys Val Tyr Thr Val Gin S r Asp Val Trp Ser Tyr Gly lie 
35 830 835 840 
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Leu Leu Trp Glu lie Phe Ser Leu Gly Leu Asn Pro Tyr Pro 
845 850 855 

Gly lie Leu Val Asn Ser Lys Phe Tyr Lys Leu Val Lys Asp 
5 860 865 870 

Gly Tyr Gin Met Ala Gin Pro Ala Phe Ala Pro Lys Asn He 
875 880 885 

10 Tyr Ser He Met Gin Ala Cys Trp Ala Leu Glu Pro Thr His 

890 895 

Arg Pro Thr Phe Gin Gin He Cys Ser Phe Leu Gin Glu Gin 
900 905 910 

15 

Ala Gin Glu Asp Arg Arg Glu Arg Asp Tyr Thr Asn Leu Pro 
915 920 925 

Ser Ser Ser Arg Ser Gly Gly Ser Gly Ser Ser Ser Ser Glu 
20 930 935 940 

Leu Glu Glu Glu Ser Ser Ser Glu His Leu Thr Cys Cys Glu 
945 950 955 

25 Gin Gly Asp He Ala Gin Pro Leu Leu Gin Pro Asn Asn Tyr 

960 965 

Gin Phe Cys 
970 



30 



(2) INFORMATION FOR SEQ ID NO: 10; 
(i) SEQUENCE CHARACTERISTICS: 



35 



(A) LENGTH: 566 amino acids 



wo 92/14748 PCr/US92/01300 



- 62 - 



(B) TYPE; amino acid 

(C) STRANDEDNESSS : 

5 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

10 (ix) FEATURE: 

(A) NAME/KEY: PDGF receptor 

(B) LOCATION: Amino acids 522-1087 

15 

(X) PUBLICATION INFORMATION: 

(A) AUTHORS: Gronwald, R. , et al. 
20 (B) JOURNAL: Proc. Natl. Acad. Sci. 

(C) VOLUME: 85 

(D) PAGES: 3435-3439 

25 

(E) DATE: 1988 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

30 Met Leu Trp Gin Lys Lys Pro Arg Tyr Glu He Arg Trp Lys 

522 525 530 535 

Val He Glu Ser Val Ser Ser Asp Gly His Glu Tyr He Tyr 
540 545 



35 
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Val Asp Pro Val Gin Leu Pro Tyr Asp Ser Thr Trp Glu Leu 
550 555 560 

Pro Arg Asp Gin Leu Val Leu Gly Arg Thr .Leu Gly Ser Gly 
565 570 575 

Ala Phe Gly Gin. Val Val . Glu Ala Thr Ala His Gly Leu Ser 
580 585 590 



10 



His Ser Gin Ala Thr Met Lys Val Ala Val Lys Met Leu Lys 
595 600 605 



15 



Ser- Thr Ala Arg Ser Ser Glu Lys Gin Ser Leu Met Ser Glu 
610 615 

Leu Lys lie Met Ser His Leu Gly Pro His Leu Asn Val Val 
620 625 630 



20 



Asn Leu Leu Gly Ala Cys Thr Lys Gly Gly Pro lie Tyr lie 
635 640 645 



He Thr Glu Tyr Cys Arg Tyr Gly Asp Leu Val Asp Tyr Leu 
650 655 660 



25 



His Arg Asn Lys His Thr Phe Leu Gin Arg His Ser Asn Lys 
665 670 675 



30 



His Cys Pro Pro Ser Ala Glu Leu Tyr Ser Asn Ala Leu Pro 
680 685 

Val Gly Phe Ser Leu Pro Ser His Leu Asn Leu Thr Gly Glu 
690 695 700 



35 



Ser Asp Gly Gly Tyr M t Asp Met Ser Lys Asp Glu S r He 
705 710 715 
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Asp Tyr Val Pro Met Leu Asp Met Lys Gly Asp He Lys Tyr 
720 725 730 

Ala Asp He Glu Ser Pro Ser Tyr Met Ala Pro Tyr Asp Asn 
735 740 745 

Tyr Val Pro Ser Ala Pro Glu Arg Thr Ti'r hrg Ala Thr Leu 
750 755 



10 



lie Asn Asp Ser Pro Val Leu Ser Tyr Thr Asp Leu Val Gly 
760 765 770 



15 



Phe Ser Tyr Gin Val Ala Asn Gly Met Asp Phe Leu Ala Ser 
775 780 785 

Lys Asn Cys Val His Arg Asp Leu Ala Ala Arg Asn Val Leu 
790 795 800 



20 



He Cys Glu Gly Lys Leu Val Lys He Cys Asp Phe Gly Phe 
805 810 815 



Ala Arg Asp He Met Arg Asp Ser Asn Tyr He Ser Lys Gly 
820 825 



25 



Ser Thr Tyr Leu Pro Leu Lys Trp Met Ala Pro Glu Ser He 
830 835 840 



30 



Phe Asn Ser Leu Tyr Thr Thr Leu Ser Asp Val Trp Ser Phe 
845 850 855 

Gly He Leu Leu Trp Glu He Phe Thr Leu Gly Gly Thr Pro 
860 865 870 



Tyr Pro Glu Leu Pro Met Asn Asp Gin Phe Tyr Asn Ala He 
35 875 880 885 
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Lys Arg Gly Tyr Arg Het Ala Gin Pro Ala His Ala Ser Asp 
890 895 

Glu He Tyr Glu He Met Gin Lys Cys Trp Glu Glu Lys Phe 
5 900 905 910 

Glu Thr Arg Pro Pro Phe Ser Gin Leu Val Leu Leu Leu Glu 
915 920 925 

10 Arg Leu Leu Gly Glu Gly Tyr Lys Lys Lys Tyr Gin Gin Val 

930 935 940 

Asp Glu Glu Phe Leu Arg Ser Asp His Pro Ala He Leu Arg 
945 950 955 

15 

Ser Gin Ala Arg Phe Pro Gly He His Ser Leu Arg Ser Pro 
960 965 

Leu Asp Thr Ser Ser Val Leu Tyr Thr Ala Val Gin Pro Asn 
20 970 975 980 

Glu Ser Asp Asn Asp Tyr He He Pro Leu Pro Asp Pro Lys 
985 990 995 

25 Pro Asp Val Ala Asp Glu Gly Leu Pro Glu Gly Ser Pro Ser 

1000 1005 1010 

Leu Ala Ser Ser Thr Leu Asn Glu Val Asn Thr Ser Ser Thr 
1015 1020 1025 

30 

He Ser Cys Asp Ser Pro Leu Glu Leu Gin Glu Glu Pro Gin 
1030 1035 

Gin Ala Glu Pro Glu Ala Gin Leu Glu Gin Pro Gin Asp Ser 
35 1040 1045 1050 
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Gly Cys Pro Gly Pro Leu Ala Glu Ala Glu Asp Ser Phe Leu 
1055 1060 1065 

Glu Gin Pro Gin Asp Ser Gly Cys Pro Gly Pro Leu Ala Glu 
5 1070 1075 1080 

Ala Glu Asp Ser Phe Leu 
1085 

10 (2) INFOEMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS; 

15 (A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

20 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: UNA (genomic) 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TCGACGCGCG ATG GAG 16 

30 



35 
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We claim: 

1. An Isolated DNA sequence encoding the 
Kinase Insert Domain containing Receptor. 
5 2. The DNA sequence of Claim 1 wherein said 

sequence Is a human gene. 

3. An Isolated DNA sequence comprising a DNA 
sequence capable of hybridizing with a DNA sequence 
encoding the amino acid sequence of Figure 7. 

10 4. A method for the production of a growth 

factor receptor which comprises transforming a host 
cell with the DNA sequence of Claim 3 and culturing the 
host cell under conditions which result in expression 
of the gene by an expression vector. 

15 5. The method of Claim 4 wherein the host 

cell is a bacteria, virus, yeast. Insect or meunmallan 
cell line. 

6. The method of Claim 5 wherein the host 
cell is a COS-1 cell, NIH3T3 fibroblast or CMT-3 monkey 

20 kidney cell. 

7. The method of Claim 5 where the 
expression vector Is pcDNAltkpASP expression vector. 

8. A lambda gtll phage harboring the clone 
BTIII081.8 (ATCC accession niimber 40,931} or the clone 

25 BTIII129.5 (ATCC Accession number 40,975} . 

9. A plasmld pBlueScrlpt KS which contains 
the clone BTIV169 (ATCC accession number 75200} . 

10. An isolated growth factor receptor 
designated the Kinase insert Domain containing 

30 Receptor. 

11. The receptor of Claim 10 comprising the 
amino acid sequence of Figure 7. 

12. The receptor of Claim 10 encoded by an 
isolat d DNA s quence comprising a DNA sequenc capable 



35 
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of hybridizing with a DNA sequence encoding the amino 
acid sequence of Figure 7. 

13. A biologically active protein fragment 
which retains the receptor activity of the receptor of 

5 Claim 10. 

14. An isolated DKA sequence encoding a 
biologically active protein fragment which retains the 
receptor activity of an isolated growth factor receptor 
designated the Kinase insert Domain containing 

10 Receptor. 

15. An oligonucleotide primer consisting of 
an oligonucleotide primer having 21 bases and having 
a sequence depicted for Primer 1 in Figure 2. 

16. An oligonucleotide primer consisting of 
15 an oligonucleotide primer having 29 bases and having a 

sequence depicted for Primer 2 in Figure 2. 

17. The 363 base pair product having the 
sequence depicted in Figure 4, or a biological 
equivalent of said sequence. 

20 



25 



35 
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